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When considering d possibly dependent random variables, one is 
often interested in extreme risk regions, with very small probability p. 
We consider risk regions of the form {z € M d : /(z) < ft}, where / 
is the joint density and ft a small number. Estimation of such an 
extreme risk region is difficult since it contains hardly any or no data. 
Using extreme value theory, we construct a natural estimator of an 
extreme risk region and prove a refined form of consistency, given a 
random sample of multivariate regularly varying random vectors. In 
a detailed simulation and comparison study, the good performance 
of the procedure is demonstrated. We also apply our estimator to 
financial data. 

1. Introduction. A two-dimensional normal density or Student t-density 
is constant on boundaries of certain ellipses. Outside such an ellipse the den- 
sity is lower than inside. It is straightforward to find such an outer region and 
its contour (line), for a given small probability. We can consider such con- 
tour as a natural multidimensional extension of a (one-dimensional) quantile. 
Even for extreme sets, that is, very low density levels, the calculations are 
straightforward. 

In this paper we consider, much more general, multivariate regularly vary- 
ing distributions [for a review, see Jessen and Mikosch (2006)]. We consider 
the latter distributions, since we want to explore in particular extreme sets, 
that is, sets far removed from the origin. A random vector X is multivariate 
regularly varying if there exist a constant a > 0, the index and an arbitrary 
probability measure f on9 = {zgR !i : ||z|| = 1}, the unit hypersphere, such 
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that 

V ' t^oo P(||X|| >t) ^ J 

for every x > and Borel set A in with fy(dA) = 0, with ||X|| the L2-norm 
of X; see Rvaceva (1962). An equivalent statement is 

(2) Itm forx>0, 

V ' t^OO P(||X|| >t) 

and there exists a measure v such that 

for every Borel set B on M. that is bounded away from the origin and satisfies 
v(dB) = 0; here tB = {tz : z G B}. Note that ^ is homogeneous, that is, for 
all a > 0, 

(4) i/(o5) = aT a v(B). 

Clearly, on {z G R d : ||z|| > 1}, z/ is a probability measure. The limit relation in 

(3) is a multivariate analogue of the "peaks-over-threshold" or "generalized 
Pareto limit" method in one-dimensional extreme value theory. Particular 
cases of (1) are distributions in the sum-domain of attraction of a-stable 
distributions and heavy tailed elliptical distributions such as multivariate 
t-distributions [see Hashorva (2006)]. 

We require the convergence in (2) and (3) at the density level: 

(a) Suppose that the distribution of X has a continuous and positive 
density / and that for some positive function q and some positive function 

V regularly varying at infinity with negative index —a, we have 

(5) lim f ^ Z } . = q(z) for all z ^ 



and 

f(tz) 



(6) lim sup 



q(z) 



t~ d V(t) 

Then q is continuous on M. d \ {0} and q(az) = a~ d ~ a q(z) for all a > and 
z/0. Throughout, we can and will take V{t) = P(||X|| > t) (see Lemma 1, 
Section 5). From Lemma 1, it follows that doing so (3) holds with v{B) = 
f B q(z)dz. 

The extreme region will be of the form 

Q = {zGM d :/(z)</3}, 
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where / is the probability density of the random vector X; j3 is determined 
in such a way that the probability of Q is equal to a given very small number 
p, like 1/10,000. 

It is the purpose of this paper to estimate Q based on n i.i.d. copies of X. 
Note that the shape of Q is not predetermined, it depends on the density /. 
For the estimation of Q, we will use an approximation of / based on the 
density of The values of p we consider are typically of order 1/n. This 
means that the number of data points that fall in Q is small and can even be 
zero, that is, we are extrapolating outside the sample. This lack of relevant 
data points makes estimation difficult. The estimation of Q is a multivariate 
analogue of the estimation of extreme quantiles in the univariate setting; 
see, for example, de Haan and Ferreira (2006), Chapter 4. The multivariate 
case is much more complicated, however, since we have to estimate a whole 
set instead of only one value. 

Having an estimate of Q can be important in various settings. It can be 
used as an alarm system in risk management: if a new observation falls in 
the estimated Q it is a signal of extreme risk. See Einmahl, Li and Liu 
(2009) for an application to aviation safety along these lines. In a financial 
or insurance setting, points on the boundary of the estimate of Q can be 
used for stress testing. The estimate of Q can also be used to rank extreme 
observations (see Remark 3, Section 2). 

For the "central" part of the distribution, that is, /3 is fixed (and "not 
too small"), nonparametric estimation of density level sets has been studied 
in depth in the literature. Two approaches are used, the plug-in approach 
using density estimation [see Bafllo, Cuesta-Albertos and Cuevas (2001) 
and Rigollet and Vert (2009)], and the excess mass approach [see Miiller 
and Sawitzki (1991), Polonik (1995) and Tsybakov (1997)]. Our estimation 
problem and (hence) our approach are quite different from these. 

This paper is organized as follows. In Section 2, we derive our estimator 
and show a refined form of consistency. A simulation and comparison study 
is presented in Section 3 and a financial application is given in Section 4. 
Section 5 contains the proof of the main result. 

2. Main result. Consider a random sample Xi, X2, . . . , X n with Xj = X, 
for i = 1, . . . , n; their common probability measure on W d is denoted with P. 
Write Ri for the radius ||Xj|| and Wj for the direction Xj/||Xj|| of Xj. We 
wish to estimate an extreme risk region of the form 

Q = {zGR d :f(z)<f3}, 

where f3 is such that PQ = p > 0, where p = p n — > 0, as n— > 00. This means 
that Q and /3 depend on n, that is, Q = Q n and j3 = j3 n . We shall connect 
Q n to a fixed set S not depending on n, defined by 

S = {z:q{z)<\}. 
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It will turn out that Q n can be approximated by a properly inflated version 
of S. In fact, it follows from (6) that the risk regions are asymptotically 
homothetic as a function of p, for small values of p. Define H(s) = 1 — V(s) = 
P(i? < s) and U(t) = iT _1 (l — 4). Note that U is regularly varying at infinity 
with index 1/a. 

We will approximate Q n in two steps by a (deterministic) region Q n . This 
approximation satisfies 

(7) PjQrAQn) ;Q 

p 

(A denotes "symmetric difference") and is based on the above limit relations. 
The region Q n can therefore be estimated using extreme value theory. The 
first step is to establish an approximation of (3 = /3{p). Let 

(b) k = k n (< n) be a sequence of positive integers such that k — > oo and 
k/n -> 0. 



The region Q n is approximated by 

l+d/a 

Q n = {z:f(z)< ' 



np 



kv{S)J {n/k){U(n/k)) d J' 

Next, we approximate Q n by a further region Q n defined in terms of the 
limit density q rather than /: 

Indeed, S and this approximation of Q n are homothetic. 
Write 

B r>A = {z : \\z\\ > r, z/\\z\\ G A} 

for a Borel set A on 0. Clearly, -B r) A = rB\^ and hence v{B T) a) = r -c V(i?i ]J 4). 
The relation between the spectral measure VP and i/ is [cf. (1) and (3)] 

= 1/(51,4) 

for a Borel set Ac 8. Recall that the spectral measure is a probability 
measure. The existence of a density qoiv implies the existence of a density 
ip of ^, that is, 



tf(A)= / V(w)dA(w), 



where A is the Hausdorff measure (surface area) on G and 

g(rw) = ar~ a ~ d ^{w). 
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Next, we write S and v(S) in terms of the spectral density: 
S = {z = rw:r> (cri/>(w)) 1/(a+c ° , w G 9} 

and hence 

v (S)= a ~ a/{a+d) [ (V>(w)) d/(a+d) dA(w). 
Je 

To estimate Q n , we need estimators for U(n/k),a,S and v(S). From the 
above expressions for S and v(S), we see that this means that we have to 
estimate U(n/k), a and tp. First, we define 

— J = Rn-k-.n 

[the (n — fe)th order statistic of the Ri, i = 1, . . . ,n]. Since the tail of the 
distribution function of R is regularly varying with index —a, we can use 
one of the well-known estimators of the extreme value index 1/a, based on 
the Ri,i = 1, . . . , n; see, for example, Hill (1975), Smith (1987) and Dekkers, 
Einmahl and de Haan (1989). It remains to estimate ip. Let K : [0, 1] — > [0, 1] 
be a continuous and nonincreasing (kernel) function with K(0) = 1 and 
K(l) = 0. For w € O, define an estimator of ^(w) by 

V-n(w) = — 1—2^ K 
i=l 



= {vG6:w T v>l-/i}; 
Hence, in view of (8), 

with 

S = {z = rw:r>(a|„(w)) 1 /( W ',we8} 

and 

^ = a - S /(S+d) I $ n ( w ))d/(a+d) dX ( w y 

Je 

In the definition of the set S, the choice of the value 1 was not motivated. We 
could have taken any number c > instead. Such an alternative definition of 



1 - w J W 

h 



[Ri>R n 



with < h < 1 and 

c(h, K)=(J c h K ( 1 ~ dA ( v )) . C ^ h )~- 

cf. Hall, Watson and Cabrera (1987). 

For estimating Q n it suffices to estimate Q n , see (7). 
we define 



G 



J.-J. CAI, J. H. J. EINMAHL AND L. DE HAAN 



S would lead to exactly the same estimator Q n , which shows that the value 
1 plays no role. 
Assume 

(c) lim — = c for some c 6 (0, oo). 

t— >oo t l a 

Note that this simple condition is weaker than the usual second order condi- 
tion with negative second order parameter p [see, e.g., Theorem 4.3.8 in de 
Haan and Ferreira (2006)]; indeed, there exist functions U with p = that 
satisfy condition (c). 

Theorem 1. Letp^-0 as n— >oo. Assume conditions (a), (b), (c) hold 
and that a is such that \fk{a — a) = Op(l). Also assume that (lognp) / Vk — >■ 
0, h — > and k/(c(h, K) log k) — > oo, as n — > oo. Then we have 

(10) > as n — »■ oo, 

P 

and hence 

P(Qn) 



1. 



P 



Remark 1 . The tuning parameter k is used in the estimators of a, U(n/k) 
and ip. It is important to be able to choose three different values for k, 
denoted with k a ,kjj and k^, respectively. (Note that "good" values of k a 
and kjj are determined by the tail of H — the distribution function of R\ — 
whereas a good k^ is determined by the conditional distribution of Wi, 
given that R% > r, for large r.) If we adapt the conditions of the the- 
orem, in particular if (b) holds for k a ,ku,k^ and if (\ognp)/y/k^ — > 0, 
k^/(c(h, K) log kjf,) — > oo and (log ku)/y/k^ — > 0, then (10) remains true for 
the generalized estimator that allows for the aforementioned different de- 
values. We will use this generalized estimator in the simulation study and 
the real data application. 

The actual choice of these /c-values is a notorious problem in extreme 
value theory. A solution of this problem is far beyond the scope of the 
present paper. We will only give heuristic guidelines here. First, consider 
the estimation of a. Plot 2 as a function of k. Now find the first stable, 
that is, approximately constant, region in the graph of this function. This 
vertical level is the final estimate of a. It is also possible to use (complicated) 
asymptotically optimal procedures; see, for example, Danielsson et al. (2001). 
Once the estimate a is fixed, we plot U{Jt )(^) 1 ^ a against k and we search 
for the first stable part in this graph. The vertical level is now the estimate 
of the constant c in condition (c). Observe that U(^){^) l l a is a building 
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block of Q n , so we do not need to estimate separately. Also observe 

that we do not (need to) determine k a and kjj, but only a region of good 

values. Finally, using again the already fixed a, we plot v(S) as a function 
of k and again we search for the first stable region; we take k^ to be the 
midpoint of this region of /c-values. 

Remark 2. The class of multivariate regularly varying distributions 
is quite large. It contains, for example, all elliptical distributions with a 
heavy tailed radial distribution and all distributions in the domain of a 
sum-attraction of a multivariate (nonnormal) stable distribution. It seems 
natural, however, to try to extend the assumption of multivariate regular 
variation to the case of nonequal tail indices a. It is an important feature 
of the present model that all directions are equally important: the marginal 
distributions do not play a special role. An extension to nonequal tail indices 
would be possible in principle, but it will be of limited value since it only 
works if marginal transformations lead to the present model. Also note that 
basically all linear combinations of the components inherit the lowest of the 
marginal tail indices: the tail index is not a smooth function of the direction 
(if it is not constant). Moreover, the statistical theory that will be needed 
will be challenging and will lead to a new and different project. 

Remark 3. Note that the estimated extreme risk region Q n = Q n (p) 
depends on p in a continuous way and has the property that p\ < p2 implies 
Qn{p\) C Q n {P2)- Hence, we can find the smallest p such that an observation 
is on the boundary of Q n {p)- The corresponding observation can be consid- 
ered the largest one and we know its "p-value." This is helpful in deciding 
whether some observation is the most extreme or if it is an outlier. Also, by 
continuing this procedure we can rank the larger observations. 

3. Simulation study. In this section, a detailed simulation study is per- 
formed in order to investigate the finite sample performance of our estimator 
[with 1/a estimated using the moment estimator of Dekkers, Einmahl and 
de Haan (1989) and with K(u) = 1 — u]. We consider five multivariate dis- 
tributions. 

• The bivariate Cauchy distribution with density 

This is a very heavy tailed density, with a = 1 and = l/(2-7r), for 

w G 6. 

• The trivariate Cauchy distribution with density 

(12) f{x,y,z) = 1 (x,y,z)eR 3 . 

7T Z (1 + x z + y z + z ) 
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This is also a very heavy tailed density, with a = 1 and 
for w £ 0. 

A bivariate elliptical distribution with density (ro ~ 1.2481) 
3 : r 4 (l + r ( 



1/(4tt) 



(13) 



f(x,y) 



Jh-3/2 



47T 



'0, 



x 2 /4 + y 2 < r ( 2 



0' 



2\2 



3(x 2 /4 + y 2 ) 



k 47r(l + (x 2 /4 + y 2 ) 3 ) 3 / 2 ' 



x 2 /4 + y 2 > 7-q. 



It is less heavy tailed. We have a = 3 and ip(wi,W2) = c(l + 3w 2 ) ' , w 
(wi,w 2 )ee, with cw 0.6028. 

A bivariate "clover" distribution with density (ro ~ 1.2481) 



(14) 



f(x,y) 



10vr 



5 + 



ro(x 2 + y 2 ) 3 / 2 



x 2 + y 2 <r 2 , 

,2\2 



3(9(x 2 + y 2 ) 2 -32x 2 y 2 ) 
10vr(l + (x 2 + y 2 ) 3 ) 3 / 2 
x 2 + y 2 > Tq. 



3, again, and ip(w\,W2) = (9 



We have a 

w 2 ) e 6. 

A bivariate asymmetric shifted distribution with density [ro 
f(x, y) := r V ((x + 5) 2 + y 2 ) 1 / 2 ] 



32iofu;|)/(107r),w = (wt, 
1.2331, 



(15) /(x,y) 



^2 



6vr(l +f 4 (x,y)) 5 /4 

^(^,y) 

k 6vr(l +f 4 (x,y)) 5 /4 



3 + 



3 + 



x + 5 
f(x,y) 

(x + 5) 3 



3(x + 5)y 2 



x,y) 



y>0, 



y<0. 



This distribution is not symmetric and the "center" is not the origin, but 
(—5,0); a = 1 and -0(^1,^2) = g^(3 + w{), if W2 > 0, and ijj(wi,W2) = 
^(3 + 4^-3^1), if w 2 <0, w=(w 1 ,w 2 )e&. 

First, we simulated single data sets of size 5,000 of the bivariate Cauchy 
distribution, the elliptical distribution in (13), the clover distribution in (14) 
and the asymmetric shifted distribution in (15). We computed the true and 
estimated risk regions for p = 1/2,000, 1/5,000 or 1/10,000. This is depicted 
in Figure 1 . We see that the estimated regions are relatively close to the true 
risk regions. It is interesting to note that the p-value (see Remark 3) of the 
largest observation for the Cauchy sample is 0.000209, which is about 1/n. 
This shows that this observation is a typical one. (Looking at the data only, 
one might want to conclude that this observation is an outlier.) Also note 
that for the bivariate Cauchy distribution, for, for example, p= 1/10,000, 
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Cauchy Density, n=5000, p=1/2000, 1/5000, 1/10000 Elliptical Density, n=SO00, p=1«000, 1/10000 




Fig. 1. True and estimated risk regions based on one sample of size 5,000 from the 
bivariate Cauchy distribution, the elliptical distribution in (13), the clover distribution in 
(14) and the asymmetric shifted distribution in (15). 



the density / at the boundary of the true risk region is less than 10~ 12 . This 
emphasizes that we are estimating in an "almost empty" part of the plane 
and that a fully nonparametric procedure could not work here. 

In addition, we simulated one sample of the bivariate distribution with 
independent t3-components. This distribution does not satisfy condition (a), 
since the spectral measure is discrete and concentrated on the intersection 
of the coordinate axes with the unit circle. We also simulated one sample 
of a bivariate "logarithmic" distribution with a = 1 and uniform spectral 
measure, but where the radial distribution satisfies J7 (£)/(£ log i) tends to a 
constant and hence U(t) jt — > 00 as t — > 00, that is, this distribution does not 
satisfy condition (c). Although both distributions do not satisfy our condi- 
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Independent Components, n=5000, p=1/5000 Logarithmic, n=S0O0, p=1/5000 




i 1 1 1 1 1 1 1 1 r 



-40 -20 20 40 -200000 -100000 100000 200000 

Fig. 2. True and estimated risk regions based on one sample of size 5,000 from the 
bivariate distribution with independent tz-components and the "logarithmic" distribution. 

tions, we see nevertheless satisfactory behavior of the estimator in Figure 2. 
In the left panel, the estimated region has about the right size and the dif- 
ficult shape is approximated reasonably well; in the right panel, we see that 
both the shape and the size are approximated quite well. 

After this visual assessment of our estimator based on one sample at a 
time, we now investigate its performance based on 100 simulated samples of 
size 5,000. We will compare our estimator (denoted EVT) to a nonparametric 
and to a more parametric estimator. The nonparametric estimator is only 
defined in case p = 1/n and tries to mimic the largest order statistic as 
an estimator of the (1 — l/n)th quantile in the univariate case. It aims 
at elliptical level sets. It is defined as follows. First, calculate the smallest 
ellipsoid containing half of the data, the so-called MVE. Then inflate this 
ellipsoid, such that the "largest" observation lies on its boundary. Now the 
region outside this ellipsoid is the estimator. 

For d = 2, the more parametric estimator is defined similarly to Q n in (9), 
but (only) the estimation of (v(S)) l l a S is done parametrically. Therefore, 
this estimator has the same size as Q n , but a different shape. (Note that 
the fully parametric estimator based on multivariate normality would have 
a very bad performance.) Take the k observations with radius Ri > R Tl -k- n 
and consider the transformed data (i?j/i? n _fc :n , Wj). In line with the limit 
result in (1), assume that these data have a "distribution" (•) -a \E', where \I/ 
depends on a parameter p. To be precise, we assume for the density 

ij) p {9) = (4vr)- 1 (2 + sin(2(0 - p))), < 6 < 2vr, < p < n. 
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Table 1_ 

Median of the relative errors P(Q„AQ n )/p of the three 
estimators, for p = 1/5,000 (pi) and 1/10,000 (p2) 



Density 


EVT pi 


Par pi 


NP pi 


EVT p2 


Par p2 


Biv. Cauchy 


0.28 


0.29 


0.72 


0.31 


0.32 


Triv. Cauchy 


0.22 




0.54 


0.24 




Elliptical 


0.36 


0.51 


0.80 


0.39 


0.54 


Clover 


0.44 


0.57 


0.58 


0.49 


0.61 


Asymm. shifted 


0.26 


0.27 


0.61 


0.30 


0.32 



(Here a point on the unit circle is represented by its angle € [0,27r).) Now 
a and p are estimated by maximum likelihood; observe that this yields the 
Hill estimator for 1/a. 

Table 1 shows for the three different estimators the median of the 100 
relative errors P(Q n AQ n )/p for p = 1/5,000 (pi) and 1/10,000 (p2). In Fig- 
ure 3, boxplots are shown of the relative error P(Q n AQ n )/p for p = 1/5,000 
(pi) and 1/10,000 (p2). From this table and figure, we see a good perfor- 
mance of our estimator. Its behavior does not change much if p changes 
from 1/5,000 to 1/10,000. The parametric estimator performs reasonably 
well, but it is outperformed by our estimator, in particular for the elliptical 
and clover densities. Recall that this estimator can be seen as a modifica- 
tion of our estimator, since it uses the same estimated inflation factor, but 
the shape is estimated differently. We see a moderate performance of the 
nonparametric estimator; also, it cannot be adapted to p= 1/10,000. Given 
that the estimation of these extreme risk regions is a statistically difficult 
problem, we see decent behavior of the three estimation methods. Obviously 
the parametric and the nonparametric estimator do not perform well if the 
parametric part of the model is not adequate or if the shape of the region is 
not elliptical, respectively. The EVT estimator, presented in this paper, does 
not suffer from these shortcomings and performs well for many multivariate 
distributions. 

4. Application. In this section, an application of our method to for- 
eign exchange rate data is presented. The data are the daily exchange 
rates of Yen-Dollar and Pound-Dollar from January 4, 1999 to July 31, 
2009. Consider the daily log returns given by X^j = log (Yj+ij/Yjj), with 
i = 1, . . . , 2,664, j = 1,2, and Yj i is the daily exchange rate of the Yen to the 
Dollar and Y,2 of the Pound to the Dollar. First, we check the equality of 
the extreme value indices (the reciprocals of the tail indices) of the right and 
left tails of both marginal distributions and that of the radius. This yields 5 
extreme value indices; the 5 estimates in increasing order are: 0.141, 0.191, 
0.223, 0.242, 0.256. Hence, the maximal difference is 0.115. Based on the 
asymptotic normality of the moment estimator of the extreme value index, 
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Bivariate Cauchy Density Trivariate Cauchy Density 



Par p1 NP p1 EVT p2 

Elliptical Density 



EVT p1 Par p1 



NPp1 

Clover Density 



~~r 



EVTp2 Parp2 EVT p1 Par p1 

Asymmetric Shifted Density 



EVT p2 Par p2 




EVT p1 Par p1 NP p1 EVT p2 Par p2 



Fig. 3. Box-plots of P(Q n AQ n )/p for the here proposed estimator and for the parametric 
and the nonparametric estimator, based on 100 simulated data sets of size 5,000 from the 
five presented densities for p — 1 / '5 ,000 (pi) and 1/10,000 (p2). 
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d 




Fig. 4. Estimator of 4> of bivariate exchange rate returns. 



we compute an approximate upper bound for the maximal difference of the 
5 estimators under the null hypothesis of equality: 0.264. Hence, there is no 
evidence that the 5 extreme value indices are different. Other exchange rate 
data sets share this property. There are also economic arguments supporting 
this claim. Therefore, we estimate a based on the radius and find a = 3.90. 
As a next step, we estimate the density ip of the spectral measure. The es- 
timate tp n is depicted in Figure 4; it is almost periodic with period ir. This 
yields that the boundary of the estimated extreme risk region is not like 
a circle, but more like an ellipse. The location of the maxima of ip n corre- 
spond to the major axis of the region. We estimate the extreme risk regions 
for p = 1/2,000,1/5,000 and 1/10,000; see Figure 5. For risk management 




Yen-Dollar 



Fig. 5. Estimated extreme risk regions of exchange rate returns. 
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of financial institutions in the U.S., it is important to know which extreme 
exchange rate returns w.r.t. the Pound and the Yen can occur and which 
returns essentially never occur. Our estimate answers this question. More 
specifically, points on the boundary of the estimated extreme risk region 
can be used as multivariate stress test scenarios. A scenario on the intersec- 
tion of the major axis of the ellipse-like boundary of the extreme risk region 
and the boundary itself corresponds to a larger shock than a scenario on the 
intersection of the minor axis of the ellipse-like boundary and the boundary 
itself, but our method shows that their "extremeness" is about the same. 

5. Proofs. For the proof of the theorem, we need several lemmas and 
propositions. We assume throughout that the conditions of the theorem are 
in force. We start with a lemma on regular variation in M. d . 

Lemma 1. Write 1 = 1/ /ri| z ||>n q( z ) dz. For any e > 0, 

/(**) 



(16) 

Moreover 
(17) 



lim sup 



t— >oo 



Z >£ 



lim 



t~ d V(t) 

xetB) 



q(z) 
q(z) dz 



0. 



i->oo V(t) Jb 

for any Borel set B bounded away from the origin. Define qt(z) =t(U(t)) d x 
f(U(t)z). Then 



(18) 



lim sup \qt(z) — lq( z )\ = 0. 



t— >oo 



ll z ll>£ 

Let h be the density of H , then 



(19) 



h(t) 



a 



lim 

t^oot^Vit) I 



Proof. For any |z| > e > [cf. Theorem 2.1 in de Haan and Resnick 
(1987)], 



t~ d V(t) 



q(z) 



/(t||z||(z/||z||)) (t||*||)-^(t||*||) 



{t\\z\\)- d V{t\\z\\) t- d V{t) 



< z 



/(i||z||(z/||z||)) 



(t\\z\\)- d V{t\\z\\) 



\z\\r d V{t\\z\\) 



(t\\z\\)~ d V(t\\z\\) 
Then (16) follows from condition (a). 



t~ d V(t) 
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Let a Borel set B be such that B C { ||z|| > 7} , for some 7 > 0. Then for z G 
B and sufficiently large t, f(tz)/t- d V(t) is bounded by gdlzll"^)!^!!^/ 2 ^. 
Hence, (17) holds by Lebesgue's dominated convergence theorem. 

We have from (17), as t — > 00, 

ty0 7 W )--M|L-H. 
v v " ¥(R>U(t)) 

Hence (16) implies, uniformly for ||z|| > e, 

»W = (y ' g ''»» (OT(»)) ^'' 1,1 - 

Note that 

f'OQ r 

1 _ H{t) = P(R > t) = / / /(rw) cZA(w)r d_1 dr. 
Jt Jo 

By taking derivatives, (16) and the homogeneity of q, we obtain 

} im + -u/u\ = / 9(w)dA(w)=a / q(z)dz = a/l. 

t^cot l V{t) J @ J{\H>i) D 

We now see that (5) and (6) hold with V = 1 — H . From now on, we will 
make the choice V = 1 — H and hence 1 = 1. Note that with this choice the 
relations (3) [with v(B) = f B q(z)dz] and (4) readily follow from (17). 

Corollary 1. For all Borel sets B with positive distance from the ori- 
gin, 

(20) lim tP(U(t)B) = v(B) 

t— >oo 

and 

n->oo p \ \k J \ np J J 

Proof. From F(R > U(t)) = l/t and (3), we obtain (20). It follows from 
(c) that 

(22 ) gMgVg) 

V ' U(n/k)(kv(S)/(np)) l / a 

This yields (21). □ 

Lemma 2. For each e > 0, there exists a 5 > and to > suc/i i/iai /or 



t- d v(t) 



< e } C {z: ||z|| > <5}. 
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PROOF. It is sufficient to prove {z: ||z|| < 6} C {z : f(tz)/(t~ d V(t)) > e}. 
First, by (6) and the continuity of q, for some c\ > 0, there exists so > 
such that for s > sq 

. /(sw) 
weo s" d y(s) ~ 

and also for si,S2 > so [cf- Proposition B. 1.9.5 in de Haan and Ferreira 
(2006)] 



V(si) > l(s 1 



-a/2 

V(S 2 ) " 2 V.S2, 

Now for t> so and any z £ {z : ||z|| < 5}, there are two possibilities, 
(i) t\\z\\ > so, then 

f(tz) /(t||z||(z/||z||)) (i||z||)-<V(i||z||) ^ 1 



t-'V(t) (t||z||)-^(t||z||) t~dV(t) 



(ii) t||z|| < so, then by continuity of / and / > 0, we have for some C2 > 0, 
f(tz) > C2, and hence, since lim^oo t~ d V(t) = 0, we obtain for t > to(> so) 

f(tz) 

> £. 



t~ d V(t) □ 



Lemma 3. For e > and large n, 



Q n CU[^){z:q(z)<l+e} 



and 



0„DC/^){z:g(z)<l- £ }. 

PROOF. Recall that Q n = {z : f(z) < (^y (n/fc)( J (n/fc))d }• It fol- 
lows from (22) that for n large enough and E\ > 



P / I V V P / J~\ k "( s )J (n/k)(U(n/k)) 



P J l y y yfc J J \ \ P 

( ^^{ Z : gi/(s)/p (z)<l + ei }. 



EXTREME RISK REGIONS 17 

Now Lemma 2 implies {z : q„(s)/p( z ) < 1 + £1} C {z : ||z|| > 5}, hence we have 
by (18) 

Q n cu(^-){z:q(z)<l + e}. 



P 

The other inclusion follows in the same way (but Lemma 2 is not needed). 

□ 

Lemma 4. For e > and large n, 

Q n Cf/^j{z:g(z)<l+ £ } 



and 



Q n DU[^-){z:q(z)<l-e}. 



Proof. Recall that Q n = *7(f)(^) 1/a {z : q ( z ) < 1}. 
Put T n = (C7(^))-l[/(| )(^) 1 /-, then 

Qn = (^) { T « z : <?( z ) < 1} = ^ (^) {^nz : g(T n z) < T~ d ~ a } 

= [/(^){z:,(z)<T-^}. 
Since T n — > 1 as n — > oo by (22), the result follows. □ 

Proposition 1. We have 

lin F(Q„AQ„) =0 

71— >00 p 

Proof. _Note that P(Q n A Q n ) < P(Q n _AQ n ) + P{Q_ n A Q n ). Observe 
that Q n C Qn or Q n C Q n , hence P(Qn A Q n ) <\p— P{Q n )\. By Lemma 3 
and Corollary 1, for any e > and large n 

U{S) -P{Q n ) < U ~^P (u ( ^1) { z : q(z) < 1 + e} 



p v v v v 

-H/({z:q(z) < 1 + e}) 

= v({z:q(z{l + e) l l^)<l}) 

= K{(l + e)" 1/(d+Q) z:9(z)<l}) 
= (l + e) a/{d+a) u(S). 

P(Qn) < (-i , F \a/(2+a) 



Thus, limsup^ ™ < (1 + eY 
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Similarly, we have lim inf > (1 - e) a ^ 2+a \ Hence, 

lim^oo = 1, that is, lim^oo P(Q " p AQ " ) = 0. 

In the same way, it follows from Lemmas 3 and 4 that 

V{S) -P{Q n *Q n )<^p(u(?W){*:l-e< ,(z) < 1 + e} 



P P \ \ P 

^v({z:l-£<q(z)<l + e}) 

= + e) Q /( d+a ) - (1 - e) Q /( d+Q )). 

Hence, lim^ P(0 '; A Q" } = 0. □ 

The following proposition shows uniform consistency of ip n and might 
be of independent interest. There is an abundant literature on density es- 
timation for directional data. In particular, uniform consistency of density 
estimators for directional data has been established in Bai, Rao and Zhao 
(1988). Here, however, the data do not have a fixed probability density on 0: 
the density ip is defined via a limit relation. Hence, tp is only an approximate 
model for the directional data. As a consequence, a more general result is 
required. 

Proposition 2. As n — > oo, 

sup \ip n (y) - ^( w )l ->■ 0- 

wee 

Proof. It is easy to see that, for any 77 > 0, there exists a function 



with 1 > ol\ > a?2 > • • • > ct m > and = ro < r\ < ■ • • < r m = 1, such that 

sup - if* (it) I <??• 

«6[0,1] 

Write Ui = 1 — H(Ri), i = 1, . . . ,n, and denote the corresponding order 
statistics with Ui- n . Let P be the probability measure on x (0,1) corre- 
sponding to (Wi,lq) and let P n be the empirical measure of the (Wj,f/j) 
i = 1, . . . , n. Define 



^) = —j—2^K I )l[fl i >ii„_ fc:n ] 

i=i 



and 



Cj(w) = nc{h > K) P^D^ x (0,C/ fc:n ]) 
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with D WJ ={v£ 6:1- hrj < w T v < 1 — hrj-i}. Observe that ipn( w ) = 
YljLi Q yY'nj( w )- Also write 

<Mw) = nc{h k K) P(D wJ x (0,U k:n ]). 
Let e > 0. It is sufficient to show that for large n 



(23) 



(24) P sup 
\ wee 



sup 

, wee 



V>n(w) -^ajVnj(w) 



i=i 



X>,ftMw) - C (/ l ,K)*p WJ )) 



i=i 



> 2e < 2e, 



> 2e < e, 



(25) 



sup 
wee 



<e. 



For wE0 and <5 G (0, 1), write C<5 = {C w (a) : w G 0, a < 5}. Note that, as 
n — > oo, 



(26) 



1 



ceCi,o<s<2 -Ho) 



-P(C x (0,sfc/ral) -s*(C) 



0. 



This readily follows from 



rP(Cx (0,8k/n]) = rn WeC,R>U( ^ 
k k \ \sk 

f(rw) 



k Ju(n/(sk))Jc r d V(r) 



d\{w)r- l V{r)dr 



and (16) and (19). 

Now we prove (23). It is easy to show that 



c{h,K) 

and hence 
(27) 

We have 

$„(w)-<(w)| 
c(/i, K) 



27r (d-i)/ 



r((d-i)/2) 



2 /•! 



A" 



1-t 



( l_ t 2 )( d-3)/2^ 



limsupc(/i, K)X(C w (h)) < oo. 



A: 



i=l 



1 - W J W,- 



1 - w 7 W, 
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c(h,K) 



(28) < 



n 



, / , ■ l -\W i £Cw(h),Ri>R n - k:n } 
i=l 



<v nc{h : K) p(c w (h)x(o,u k:n \) 



+ r] 



k 

nc(h, K) 

k 1 



[Pn-P)(C w (h) X (0,U k:n ] 



By (26), for r\ small enough the first term is less than e, with probability 
tending to one, uniformly in w G 0. Also, 



J"=l 



(29) 



< - V'nj(w) 



m 
3=1 



nc(h, K) 
k 



\(P n -P)(D wd x (0,U k:r 



From (29), (28) and (27), we see that for a proof of (23) it remains to show 
that 

-———sup sup |(P n -P)(C w (a/i)x(0,tf fc:n ])|40. 

KA^O w (n,JJ W £0O<a<l 

It can be shown that there exists a constant c = c{d) and finitely many w/ , 
I = 1, . . . , lh such that lh = 0(c(/i, K)) as h \, 0, and for every w£0 and 
0<a< 1 



C w (a/i) G C Wl (ch) for some L 



Hence for ei > 
ii 



sup sup |(P n - P)(C w (a/i) x (0,£/ fc:n ])| > e x 



weeo<a<i 



<P max sup sup \(P n - P)(C x (0,sfc/ra])| > £ifc/nA(C w (/i)) 

\l<l<lh CcC Wi (c/i) 0<s<2 

cec h 
+ P(C/ fe:n >2fc/n) 

<E P ( sup sup |(P„-P)(Cx (0,sk/n])\ >eifc/nA(C w (/i)) 



;=1 v CcC„ j (c/ l )0<s<2 

+ P(U k .. n >2k/n). 
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The latter probability tends to 0, so it suffices to consider the sum of the lh 
probabilities. Write 6 = Eik\(C w (h)). Fix I and define N = nP n (C W; (c/i) x 
(0,2/c/n]), fi = nP(C W; (ch) x (0,2/c/n]). Define the conditional probability 

measure P c = ^ on C W; (c/i) x (0, 2/c/n] and let P C}T be the corresponding 
empirical measure, based on r observations. We have 

sup sup n\(P n -P)(Cx (0,sk/n])\ >b 

CcC Wj (cfc)0<s<2 

cec h 

r=|#+6/3j 

< V P( sup sup n|(P n -P)(Cx (0,sk/n])\ >b\N = r 

r=^-b/3] K CcC mi (ch)0<s<2 

cec h 

x F(N = r) + P(|JV - Ml > V3) 



r=L/U+6/3j 
r=[ ^ m VccC W; (ch)0<s<2 



P„--p)(Cx(0, S fc/n]) 



b 

> - 
~ 2 



N = r )F(N = r) 



r=[p.+b/3j 

+ E 



sup sup n 

CcC Wl (cfc)0<s<2 
CdC h 



1 1 



P(C x (0,sk/n]) 



> 



N = r)¥(N = r) 



(30) 



+ F(\N-fi\ > 6/3) 
r=L^+6/3J 



< 



E 



sup sup r|(P c , r - P e )(C x (0,sA;/n])| > - 

r=r " /3l VCcC7 W; (ch)0<s<2 2 

x F(N = r) 

r=\jj,+b/3\ 



r-fi\>- )F{N = r) + F(\N - n\> 6/3). 



+ E 

r=\fi-b/3] 

Note that the first probability of the second sum in the right side of (30) 
is equal to 0. From Bennett's inequality [cf. Shorack and Wellner (1986), 
page 851], it follows that for some constant c\ 

F(\N -fi\> 6/3) < 2ex p(- £ ? c i^^y 
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Hence, since lh = 0(c(h,K)) 
h 



H\N - fx\ > 6/3) = O (c(h, K) exp (-efci 



o(l). 



To complete the proof of (23), we need to consider the first sum in the 
right side of (30). For the first probability in there, we use Corollary 2.9 in 
Alexander (1984), a good probability bound for empirical processes on VC 
classes. We obtain as an upper bound 



16 exp 



■-)■ 



Using r < /i + 6/3, we find for some constant C2 



l h r=[p,+b/3j 

E E 

1=1 r= |>-6/3l 



sup sup r\(P c , r -P C ){C x (0,sk/n])\ > - 

CcC w ,(c6)0<s<2 2 

cec h 



x ¥(N = r) 
l h r=LM+6/3j . 



k 



1=1 r = 0-6/31 



(■■■2 



c{h,K) 



P(JV = r) 



< 



16gexp(- e ?c 2 -^ 



i0 



ol). 



Next, we show (24). From (27) and (26), we obtain for £2 > small enough, 



sup 
we© 



j'=i 



sup 

we© 



aj-c(/i, if) (n/k?(D W j x (0,tf fc:f J) - *(£> wj )) 



<e 2 > a i c(/ i , J FC)A(C w (/i)) + sup 
^1 wee 



^a jC (/i,^)(nC/ fc:n ,/A;-l)^(B 



3=1 



n 



<£ + 

with probability tending to one. 



y"a j c(h,K)\{C w {h)) sup V(w) < 2e 
£1 we© 
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It remains to prove (25). It is readily seen that j c , h s K*{ 1 ^ v )dX(v 



YyjLi a j^(D-w,j)- Hence, for £3 > small enough 



sup 
wee 



c(h,K)Y, a i*(Dwj)-'<l>W 

3=1 



< sup ^(w) 
wee 



< sup if)(w 
we© 



J>;A(I> w j)-l 

3=1 



+ e 3 c(h, K) ^2 a jK D w,j) 
3=1 



f Cv/[h) K*((l-vr T -v)/h)d\(v) 



/^ (h) jr((l-w*v)/fc)dA(v) 

m 

+ e 3 c(h,K)X(C w (h))^2 aj 



3=1 



< rjc(h, K)X(C w (h)) sup V(w) + e 3 c(h, K)X(C w (h)) V aj 
wee 



3=1 



<£. 



□ 



From Proposition 2 and the consistency of a, we obtain immediately, as 

n — > 00, 



1/(5) 4 u(S) 



and, for e > 0, 
(31) 



P((l + e)Sc5c(l-e)S)->l. 

Propositions. ylsn — »oo, 

P(Q n AQ n ) 



0. 



P 



Proof. Note that as n — > 00, we have 



C7 



'17 



n \ p 



1. 



HS)) 1 / 3 ^ (^(5)) 1 / Q , 

^, ^ l/S-l/a 



/)/) 



exp 



\fk(a — a) ( log A; log(np) \ 
Vk Vk J 



aa 



Combining these three limit relations, we obtain 

U{n/k)(k'i^S)/(n P )) 1 ^ w 
Uin/kjiku^/inp)) 1 /* ~* 
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This and (31) yields that with probability tending to one, as n — > oo, 
(l + e) 2 Q n cQ n C(l-e) 2 Q n . 



Then, 



P ^TUJl^rJ ((i-ema+^s) 

and, by (21), the latter expression tends to 
u((l-s) 2 S\(l + s) 2 S)/u(S) 

= u((l-e) 2 S)/u(S)-u((l + e) 2 S)/u(S) 

= (l-e)~ 2a -(l + e)- 2a , 
which in turn tends to 0, as e 4- 0. □ 

PROOF of Theorem 1. The result follows from Propositions 1 and 3. □ 
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